A Path-based Relational RDF Database

نویسندگان

  • Akiyoshi Matono
  • Toshiyuki Amagasa
  • Masatoshi Yoshikawa
  • Shunsuke Uemura
چکیده

We propose a path-based scheme for storage and retrieval of RDF data using a relational database. The Semantic Web is much anticipated as the nextgeneration web where high-level processing of web resources are enabled by underlying metadata described in RDF format. A typical application of RDF is to describe ontologies or dictionaries, but in such applications, the size of RDF data is large. As largesize RDF data are emerging and their number is increasing, RDF databases that can manage large-size RDF data are becoming ever more important. To date, some RDF databases have already been proposed; however, they have critical problems: the performance of path queries is insufficient and they cannot discriminate between schema data and instance data. In this paper, as a solution to these problems, we propose a path-based relation RDF database. In our approach, we first divide the RDF graph into subgraphs, and then store each subgraph by applicable techniques into distinct relational tables. More precisely, all classes and properties are extracted from RDF schema data, and all resources are also extracted from RDF data. Each is assigned an identifier and a path expression, and stored in corresponding relational table. Because our proposed scheme retains schema information and path expressions of each resource, unlike most conventional RDF databases, it is possible to process path-based queries efficiently and store RDF instance data without schema information. The effectiveness of this approach is demonstrated through several experiments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Path discovery by Querying the federation of Relational Database and RDF Graph

The class of queries for detecting path is an important as those can extract implicit binary relations over the nodes of input graphs. Most of the path querying languages used by the RDF community, like property paths in W3C SPARQL 1.1 and nested regular expressions in nSPARQL are based on the regular expressions. Federated queries allow for combining graph patterns and relational database that...

متن کامل

Comparing path-based and vertically-partitioned RDF databases

Given the increasing prevalence of RDF data formats for storing and sharing data on the Semantic Web, efficient storage mechanisms for RDF data are also becoming increasingly important. We survey existing storage solutions for RDF data in an RDMS. Two recent and novel storage concepts open the door for significantly better querying efficiency. The first, proposed by Matono, et al (2005), models...

متن کامل

Managing Large Scale Native RDF Semantic Repository from the Graph Model Perspective

We propose a set of solutions for managing a large scale RDF semantic repository from the perspective of RDF graph model. A native storage instead of relational database is used to hold RDF. Indices supporting regular path expression, full-text retrieval and partial OWL Lite inference are built above the storage model. Semantic ranking for resources are provided as well.

متن کامل

Ultrawrap Mapper: A Semi-Automatic Relational Database to RDF (RDB2RDF) Mapping Tool

In this demo, we will show the operation of Ultrawrap Mapper, a semi-automatic software for creating mappings from Relational Databases to RDF in the R2RML language. In 2012, the W3C ratified two related standards for mapping relational database contents to RDF: the Direct Mapping [1] and R2RML [2]. The Direct Mapping is a default mapping of relational data to RDF. The organization and content ...

متن کامل

RP-Filter: A Path-Based Triple Filtering Method for Efficient SPARQL Query Processing

With the rapid increase of RDF data, the SPARQL query processing has received much attention. Currently, most RDF databases store RDF data in a relational table called triple table and carry out several join operations on the triple tables for SPARQL query processing. However, the execution plans with many joins might be inefficient due to a large amount of intermediate data being passed betwee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005